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Abstract 

The ROOTED MAXIMUM LEAF OUTBRANCHING problem consists 
in finding a spanning directed tree rooted at some prescribed vertex of a 
digraph with the maximum number of leaves. Its parameterized version 
asks if there exists such a tree with at least k leaves. We use the notion of 
s — t numbering studied in [18], [5], [H] to exhibit combinatorial bounds 
on the existence of spanning directed trees with many leaves. These com- 
binatorial bounds allow us to produce a constant factor approximation 
algorithm for finding directed trees with many leaves, whereas the best 
known approximation algorithm has a vOPT- factor |10) . We also show 
that ROOTED MAXIMUM LEAF OUTBRANCHING admits a quadratic 
kernel, improving over the cubic kernel given by Fernau et al |12) . 



1 Introduction 

An outbranching of a digraph £> is a spanning directed tree in D. We consider 
the following problem: 

Rooted Maximum Leaf Outbranching: 

Input: A digraph D, an integer k, a vertex r of D. 

Output: TRUE if there is an outbranching of D rooted at r with at least 
k leaves, otherwise FALSE. 

This problem is equivalent to finding a Connected Dominating Set of size at 
most \ V{D)\ — k, connected meaning in this setting that every vertex is reachable 
by a directed path from r. Indeed, the set of internal nodes in an outbranching 
correspond to a connected dominating set. 

Finding undirected trees with many leaves has many applications in the area 
of communication networks, see [7] or [23] for instance. An extensive litterature 
is devoted to the paradigm of using a small connected dominating set as a 
backbone for a communication network. 

Rooted Maximum Leaf Outbranching is NP-complete, even restricted 
to acyclic digraphs [2], and MaxSNP-hard, even on undirected graphs |15) . 
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Two natural ways to tackle such a problem are, on the one hand, polynomial- 
time approximation algorithms, and on the other hand, parameterized complex- 
ity. Let us give a brief introduction on the parameterized approach. 

An efficient way of deahng with NP-hard problems is to identify a parameter 
which contains its computational hardness. For instance, instead of asking for 
a minimum vertex cover in a graph - a classical NP-hard optimization question 
- one can ask for an algorithm which would decide, in 0(/(A:).n'^) time for some 
fixed d, if a graph of size n has a vertex cover of size at most k. If such an 
algorithm exists, the problem is called fixed-parameter tractable, or FPT for 
short. An extensive literature is devoted to FPT, the reader is invited to read 
[9], [ig and [20]. 

Kernelization is a natural way of proving that a problem is FPT. Formally, a 
kernelization algorithm receives as input an instance (/, k) of the parameterized 
problem, and outputs, in polynomial time in the size of the instance, another 
instance {I' ,k') such that: k' < k, the size of /' only depends of k, and the 
instances (/, fc) and {I' ,k') are both true or both false. 

The reduced instance (/', k') is called a kernel. The existence of a kerneliza- 
tion algorithm clearly implies the FPT character of the problem since one can 
kernelize the instance, and then solve the reduced instance G', k' using brute 
force, hence giving an 0[f{k) + n'^) algorithm. A classical result asserts that 
being FPT is indeed equivalent to having kernelization. The drawback of this 
result is that the size of the reduced instance G' is not necessarily small with 
respect to fc. A much more constrained condition is to be able to reduce to 
an instance of polynomial size in terms of k. Consequently, in the zoology 
of parameterized problems, the first distinction is done between three classes: 
W[l]-hard, FPT, polykernel. 

A kernelization algorithm can be used as a preprocessing step to reduce the 
size of the instance before applying some other parameterized algorithm. Being 
able to ensure that this kernel has actually polynomial size in k enhances the 
overall speed of the process. See [16] for a recent review on kernelization. 

An extensive litterature is devoted to finding trees with many leaves in undi- 
rected and directed graphs. The undirected version of this problem, MAXIMUM 
Leaf Spanning Tree, has been extensively studied. There is a factor 2 ap- 
proximation algorithm for the MAXIMUM LEAF SPANNING Tree problem [21] , 
and a 3.75A: kernel [ll]. An 0*(1,94") exact algorithm was designed in [14] . 
Other graph theoretical results on the existence of trees with many leaves can 
be found in [8] and [22]. 

The best approximation algorithm known for MAXIMUM LEAF OUTBRANCHING is 
a factor \jOPT algorithm [lO]. From the Parameterized Complexity viewpoint, 
Alon et al showed that Maximum Leaf Outbranching restricted to a wide 
class of digraphs containing all strongly connected digraphs is FPT [1], and 
Bonsma and Dorn extended this result to all digraphs and gave a faster pa- 
rameterized algorithm [4]. Very recently, Kneis, Langer and Rossmanith [17] 
obtained an 0*(4'') algorithm for MAXIMUM LEAF OUTBRANCHiNG, which is 
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also an improvement for the undirected case over the numerous FPT algorithms 
designed for MAXIMUM LEAF SPANNING TREE. Fernau et al [12] proved that 
Rooted Maximum Leaf Outbranching has a polynomial kernel, exhibiting 
a cubic kernel. They also showed that the unrooted version of this problem ad- 
mits no polynomial kernel, unless polynomial hierarchy collapses to third level, 
using a breakthrough lower bound result by Bodlaender et al [3]. A linear ker- 
nel for the acyclic subcase of ROOTED MAXIMUM LEAF OUTBRANCHING and 
an 0*{3,72'^) algorithm for ROOTED MAXIMUM LEAF OUTBRANCHiNG were 
exhibited in [6]. 

This paper is organized as follows. In Section [2] we exhibit combinatorial 
bounds on the problem of finding an outbranching with many leaves. We use 
the notion of s — < numbering introduced in [18] . We next present our reduction 
rules, which are independent of the parameter, and in the following section 
we prove that these rules give a quadratic kernel. We finally present a constant 
factor approximation algorithm in Section[5]for finding directed trees with many 
leaves. 

2 Combinatorial Bounds 

Let D be a directed graph. For an arc (u, v) in D, we say that u is an in- 
neighbour of V, that V is an outneighbour of u, that {u, v) is an in-arc of v and 
an out-arc of u. The outdegree of a vertex is the number of its outneighbours, 
and its indegree is the number of its in-neighbours. An outbranching with a 
maximum number of leaves is said to be optimal. Let us denote by maxleaf{D) 
the number of leaves in an optimal outbranching of D. 

Without loss of generality, we restrict ourselves to the following. We ex- 
clusively consider loopless digraphs with a distinguished vertex of indegree 0, 
denoted by r. We assume that there is no arc (it,r) in D with u E V{D), and 
no arc {x,y) with x ^ r and y an outneighbour of r, and that r has outdegree 
at least 2. Throughout this paper, we call such a digraph a rooted digraph. 
Definitions will be made exclusively with respect to rooted digraphs, hence the 
notions we present, like connectivity and resulting concepts, do slightly differ 
from standard ones. Let -D be a rooted digraph with a specified vertex r. 

The rooted digraph D is connected if every vertex of D is reachable by a 
directed path rooted at r in D. A cut of D is a set S C V{D) — r such that 
there exists a vertex z ^ S endpoint of no directed path from r in D — S. We 
say that D is 2- connected if D has no cut of size at most 1. A cut of size 
1 is called a cutvertex. Equivalently, a rooted digraph is 2-connected if there 
are two internally vertex-disjoint paths from r to any vertex besides r and its 
outneighbours. 

We will show that the notion of s — t numbering behaves well with respect to 
outbranchings with many leaves. It has been introduced in [18] for 2-connected 
undirected graphs, and generalized in [^ by Cheriyan and Reif for digraphs 
which are 2-connected in the usual sense. We adapt it in the context of rooted 
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digraphs. 

Let D be a 2-connected rooted digraph. An r — r numbering of D is a 
Hnear ordering a of V{D) — r such that, for every vertex x ^ r, either x is 
an outneighbour of r or there exist two in-neighbours u and v of x such that 
(t(u) < (j{x) < a{v). An equivalent presentation of an r — r numbering of 
D is an injective embedding / of the graph D where r has been dupHcated 
into two vertices ri and r2, into the [0, 1] -segment of the real line, such that 
/(ri) = 0, f{r2) = 1, and such that the image by / of every vertex besides ri 
and r2 lies inside the convex hull of the images of its in-neighbours. Such convex 
embeddings have been defined and studied in general dimension by Lovasz, Linial 
and Wigderson in [19] for undirected graphs, and in [5] for directed graphs. 

Given a linear order cr on a finite set V, we denote by a the linear order on 
V which is the reverse of a. An arc uv of 13 is a forward arc if u = r or if u 
appears before v in a; uv is a backward arc if u = r or if u appears after v in a. 
A spanning out-tree T is forward if all its arcs are forward. Similar definition 
for backward out-tree. 

The following result and proof is just an adapted version of [5], given here 
for the sake of completeness. 

Lemma 1 Let D be a 2-connected rooted digraph. There exists an r — r num- 
bering of D . 

Proof: By induction over D. We first reduce to the case where the indegree of 
every vertex besides r is exactly 2. Let a; be a vertex of indegree at least 3 in 
D. Let us show that there exists an in-neighbour y of x such that the rooted 
digraph D — {y,x) is 2-connected. Indeed, there exist two internally vertex 
disjoint paths from r to x. Consider such two paths intersecting N~{x) only 
once each, and denote by D' the rooted digraph obtained from D by removing 
one arc (jj,x) not involved in these two paths. There are two internally disjoint 
paths from r to x in D' . Consider z G V(D) — r — x. Assume by contradiction 
that there exists a vertex t which cuts z from r in D' . Ast does not cut z from 
r in D and the arc {y, x) alone is missing in D' , t must cut x and not y from r 
in D' . Which is a contradiction, as there are two internally disjoint paths from 
r to X in D' . By induction, D' has an r — r numbering, which is also an r — r 
numbering for D. 

Hence, let D be a rooted digraph, where every vertex besides r has indegree 
2. As r has indegree 0, there exists a vertex v with outdegree at most 1 in D 
by a counting argument. If v has outdegree 0, then let tr be an r — r numbering 
of D — V, let ui and U2 be the two in-neighbours of v. Insert v between ui and 
U2 in a to obtain an r — r numbering of D. Assume now that v has a single 
outneighbour u. Let w be the second in-neighbour of u. Let D' be the graph 
obtained from D by contracting the arc {v, u) into a single vertex uv. As D' is 
2-connected, consider by induction an r — r numbering a of D' . Replace uv by 
u. It is now possible to insert v between its two in-neighbours in order to make 
it so that u lies between v and w. Indeed, assume without loss of generality 
that w is after uv in a. Consider the smallest in-neighbour t of w in cr. As it 
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is an r — r numbering oi D\ t lies before uv in a. We insert v just after t to 
obtain an r — r numbering of D. □ 

Note that an r — r numbering cr of D naturally gives two acyclic covering 
subdigraphs of D, the rooted digraph consisting of the forward arcs of 
D, and the rooted digraph D\g consisting of the backward arcs of D. The 
intersection of these two acyclic digraphs is the set of out-arcs of r. 

Corollary 1 Let D be a 2-connected rooted digraph. There exists an acyclic 
connected spanning subdigraph A of D which contains at least half of the arcs 
ofD-r. 

Let G be an undirected graph. A vertex cover of G is a set of vertices covering 
all edges of G. A dominating set of G is a set C such that for every vertex 

has a neighbour in S. A strongly dominating set of G is a set S' C 
such that every vertex has a neighbour in S. 

Let be a rooted digraph. A strongly dominating set of I? is a set 5 C 
such that every vertex besides r has an in-neighbour in S. We need the following 
folklore result: 

Lemma 2 Any undirected graph G onn vertices and m arcs has a vertex cover 
of size 

Proof: By induction on n + to. If there exists a vertex of degree at least 2 in 
G, choose it in the vertex cover, otherwise choose any non-isolated vertex. □ 

Lemma 3 Let G be a bipartite graph over AUB, with d{a) ~ 2 for every a G A. 
There exists a subset of B dominating A with size at most 

Proof: Let G' be the graph which vertex set is B, and where (6, h') is an arc if b 
and h' share a common neighbour in A. The result follows from Lemma [2] since 
G' has \A\ arcs and \B\ vertices. □ 

Corollary 2 Let D be an acyclic rooted digraph with I vertices of indegree at 
least 2 and with a root of outdegree d{r) > 2. Then D has an outbranching with 
at least ^if^fci — i + l leaves. 

Proof: Denote by n the number of vertices of D. For every vertex v of indegree 
at least 3, delete incoming arcs until v has indegree exactly 2. Since D is acyclic, 
it has a sink s. 

Let Z be the set of vertices of indegree 1 in Z?, of size n — 1 — Let Y 
be the set of in-neighbours of vertices of Z, of size at most n — 1 — I. Let A' 
be the set of vertices of indegree 2 dominated by Y. Let B ~ V{D) — Y~s. 
Let A be the set of vertices of indegree 2 not dominated by Y . Note that Y 
cannot have the same size as Z \J A' . Indeed, Z contains the outneighbours 
of r, and hence Y contains r, which has outdegree at least 2. More precisely, 
|y| + d{r) -\<\Z\JA'\. As B = V{D) ~Y-sandA = V{D) -A'-Z-r, 
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Figure 1: The "boloney" graph Dq 



we have that \B\ > \A\ + d(r) — 1. Moreover, as Y has size at most n — 1 — Z, we 
have that \B\ > I. Consider a copy Ai of A and a copy Bi o{ B. Let G be the 
bipartite graph with vertex bipartition and where (&, a), with a € Ai 

and 6 £ is an edge if (6, a) is an arc in D. By Lemma [3] apphed to G, there 
exists a set X C _B of size at most ^^^1^^"^^ < — ^^^^^^ — ^ which dominates A in 
D. The set C = X U F strongly dominates V{D) — r in D, and has size at most 

^ ^ 2\B\-(d(r)-l) ^ ^ ^ _ \B\+d{r)-l ^ |y| + |5| = „, _ ^ 

and \B\ > Z, this yields \X U Y\ < n — 1 — '■+^0 )-^ ^ jj jg acycHc, any set 
strongly dominating V — r contains r and is a connected dominating set. Hence 
there exists an outbranching T of D having a subset of C as internal vertices. 
T has at least '+'^(^)~^ _(_ i leaves. 

□ 

This bound is tight up to one leaf. The rooted digraph depicted in 
Figure 1 is 2-connected, has 3k — 2 vertices of indegree at least 2, d{r) = 3 and 
maxleaf{Dk) = k + 2. 

Finally, the following combinatorial bound is obtained: 

Theorem 1 Let D be a 2-connected rooted digraph with I vertices of indegree 
at least 3. Then maxleaf{D) > |. 

Proof: Apply Corollary [2] to the rooted digraph with the larger number of ver- 
tices of indegree 2 among D„ and D^. □ 

An arc is simple if does not belong to a 2-circuit. A vertex v is nice if it is 
incident to a simple in-arc. 

The second combinatorial bound is the following: 

Theorem 2 Let D he 2-connected rooted digraph. Assume that D has I nice 
vertices. Then D has an outbranching with at least ^ leaves. 

Proof: By Lemma [H we consider an r — r numbering a of D. For every nice 
vertex v (incident to some in-arc a) with indegree at least three, delete incoming 
arcs of V different from a until v has only one incoming forward arc and one 
incoming backward arc. For every other vertex of indegree at least 3 in D, delete 
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incoming arcs of v until v has only one incoming forward arc and one incoming 
backward arc. At the end of this process, a is still an r — r numbering of the 
digraph D, and the number of nice vertices has not decreased. 

Denote by Tf the set of forward arcs of D, and by Tf, the set of backward 
arcs of D. As cr is an r — r numbering of D, Tf and Tf, are spanning trees of D 
which partition the arcs of 13 — r. 

The crucial definition is the following: say that an arc uv of Tf (resp. of 
Tf,), with M ^ r, is transverse if u and v are incomparable in Tf, (resp. in Tf), 
that is if V is not an ancestor of u in Tt (resp. in Tf). Observe that u cannot be 
an ancestor of v in Tf, (resp. in Tf) since Tt is backward (resp. Tf is forward) 
while uv is forward (resp. backward) and u ^ r. 

Assume without loss of generality that Tf contains more transverse arcs than 
Tb. Consider now any planar drawing of the rooted tree Tt. We will make use 
of this drawing to define the following: if two vertices u and v are incomparable 
in Tb, then one of these vertices is to the left of the other, with respect to our 
drawing. Hence, we can partition the transverse arcs of Tf into two subsets: 
the set Si of transverse arcs uv for which v is to the left of u, and the set Sr 
of transverse arcs uv for which v is to the right of u. Assume without loss of 
generality that \Si\ > \Sr\- 

The digraph Ti, U Si is an acyclic digraph by definition of Si. Moreover, it 
has IS*;! vertices of indegree two since the heads of the arcs of IS*;] are pairwise 
distinct. Hence, by Corollary O T(, U Si has an outbranching with at least 
\Si\+d{r)-i _|_ ^ jgg^ygg^ hence so does D. 

We now give a lower bound on the number of transverse arcs in D to bound 
\Si\. Consider a nice vertex v in D, which is not an outneighbour of r, and with 
a simple in-arc uv belonging to, say, Tf. If uv is not a transverse arc, then v is 
an ancestor of w in T^. Let w be the outneighbor of v on the path from w to u 
in Tfc. Since uv is simple, the vertex w is distinct from u. No path in Tf goes 
from w to V, hence vw is a transverse arc. Therefore, we proved that v (and 
hence every nice vertex) is incident to a transverse arc (either an in-arc, or an 
out-arc). Thus there are at least tz^ll transverse arcs in D. 

Finally, there are at least transverse arcs in Tf, and thus IS"/] > ^—y^- 

In all, D has an outbranching with at least ^ leaves. □ 

As a corollary, the following result holds for oriented graphs (digraphs with 
no 2-circuit): 

Corollary 3 Every 2-connected rooted oriented graph on n vertices has an out- 
branching with at least ^^^^ leaves. 

3 Reduction Rules 

We say that P = {xi, . . . ,xi}, with / > 3, is a bipath of length ^ — 1 if the 
set of arcs adjacent to {x2, ■ ■ ■ ,xi-i} in D is exactly {{xi,Xi+i), {xi+i, Xi)\i G 
{l,...,l~l}}. 
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To exhibit a quadratic kernel for ROOTED MAXIMUM LEAF OUTBRANCHING, 
we use the following four reduction rules: 

(0) If there exists a vertex not reachable from r in D, then reduce to a trivially 
FALSE instance. 

(1) Let x be a cutvertex of D. Delete vertex x and add an arc (w, z) for every 
V e N~{x) and z g N+{x) - v. 

(2) Let P be a bipath of length 4. Contract two consecutive internal vertices 
of P. 

(3) Let a; be a vertex of D. If there exists y £ N~{x) such that N~{x) — y 
cuts y from r, then delete the arc {y,x). 

Note that these reduction rules are not parameter dependent. Rule (0) only 
needs to be applied once. 

Observation 1 Let S be a cutset of a rooted digraph D. Let T he an outhranch- 
ing of D. There exists a vertex in S which is not a leaf in T . 

Lemma 4 The above reduction rules are safe and can he checked and applied 
in polynomial time. 

Proof : 

(0) Reachability can be tested in linear time. 

(1) Let X be a cutvertex oiD. Let D' be the graph obtained from D by deleting 
vertex x and adding an arc {v, z) for every v g N~ (x) and z g {x) — v. 
Let us show that maxleaf(£') = maxleaf(D'). Assume T is an outbranching 
of D rooted at r with k leaves. By Observation [1] x is not a leaf of T. 
Let f{x) be the father of x in T. Let T' be the tree obtained from T by 
contracting x and f{x). T' is an outbranching of D' rooted at r with k 
leaves. 

Let T' be an outbranching of D' rooted at r with k leaves. N~{x) is a cut 
in D' , hence by Observation [T] there is a non-empty collection of vertices 
yi, . . . ,yi g N~{x) which are not leaves in T' . Choose yi such that yj is 
not an ancestor of yi for every j g {1, . . . ,1} — {i} . Let T be the graph 
obtained from T' by adding x as an isolated vertex, adding the arc (j/i, x), 
and for every j g {1, .. .J}, for every arc {yj,z) g T with z g iV+(a;), 
delete the arc (jjj, z) and add the arc {x, z). As yi is not reachable in T' 
from any vertex y g N^{x) — yi, there is no cycle in T. Hence T is an 
outbranching of D rooted at r with at least k leaves. Moreover, deciding 
the existence of a cut vertex and finding one if such exists can be done in 
polynomial time. 
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(2) Let P be a bipath of length 4. Let u, x, y, z and t be the vertices of P in 
this consecutive order. Let D' be the rooted digraph obtained from D by 
contracting x and y. Let T be an outbranching of D. Let T' be the rooted 
digraph obtained from T by contracting y with its father in T. T' is an 
outbranching of D' with as many leaves as T. Let T' be an outbranching 
of D' . If the father of xy in T' is z, then T' — (z, a;?/) U (z, y) U (y, a:) is an 
outbranching of D with at least as many leaves as T' . If the father of xy 
in T' is u, then T' — (u, xy) U (m, x) U {x, y) is an outbranching of D with 
at least as many leaves as T'. 

(3) Let X be a vertex of I? . Let y G (x) be a vertex such that (a;)— y cuts 
y from r. Let D' be the rooted digraph obtained from T by deleting the 
arc (y, x). Every outbranching of D' is an outbranching of D. Let T be an 
outbranching of D containing (y, x). There exists a vertex z € N~{x) — y 
which is an ancestor of x. Thus T — (j/,x) U {z,x) is an outbranching of 
D' with at least as many leaves as T. 

□ 

We apply these rules iteratively until reaching a reduced instance, on which 
none can be applied. 

Lemma 5 Let D be a reduced rooted digraph with a vertex of indegree at least 
k. Then D is a TRUE instance. 

Proof: Assume x is a vertex of D with in-neighbourhood N~{x) ~ {ui, . . . ,ui}, 
with I > k. For every i G {1, . . . , Z}, N~(x) — Ui does not cut Ui from r. Thus 
there exists a path Pi from r to Ui outside N~{x) — Ui. The rooted digraph 
D' = ^ie{i,....i}Pi is connected, and for every i e {!,...,/}, u,; has outdegree 
in D'. Thus D' has an outbranching with at least k leaves, and such an 
outbranching can be extended into an outbranching of D with at least as many 
leaves. □ 

4 Quadratic kernel 

In this section and the following, a vertex of a 2-connected rooted digraph D is 
said to be special if it has indegree at least 3 or if one of its incoming arcs is 
simple. A non special vertex is a vertex u which has exactly two in-neighbours, 
which are also outneighbours of u. A weak bipath is a maximal connected set of 
non special vertices. If P = {xi , . . . , x;} is a weak bipath, then the in-neighbours 
of Xi, for z = 2, . . . , ^ — 1 in -D are exactly Xi-i and Xi+i. Moreover, xi and xi 
are each outneighbour of a special vertex. Denote by s(P) the in-neighbour of 
Xi which is a special vertex. 

This section is dedicated to the proof of the following statement: 

Theorem 3 A digraph D of size at least {3k — 2)(30fc — 2) reduced under the 
reduction rules of previous section has an outbranching with at least k leaves. 
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Proof : By Theorem [T] and Theorem [2l if there are at least 6k + 24k — 1 
special vertices, then D has an outbranching with at least k leaves. Assume 
that there are at most SO/c — 2 special vertices in D. 

As D is reduced under Rule (2), there is no bipath of length 4. We can 
associate to every weak bipath S of of length t a set Ab of [</3] out- 
arcs toward special vertices. Indeed, let P = {xi, . . . ,xi) be a weak bipath 
of D. For every three consecutive vertices Xi, Xi^i, Xi^2 of P, 2 < i < Z — 3, 
{xi-i, Xi, Xi+i, Xi+2, Xi+s) is not a bipath by Rule (2), hence there exists an arc 
{xj,z) with j = + l or i + 2 and z ^ P. Moreover z must be a special vertex 
as arcs between non-special vertices He within their own weak bipath. The set 
of these arcs {xj, z) has the presribed size. 

By Lemma \5[ any vertex in D has indegree at most fc — 1 as D is reduced 
under Rule (3), hence there are at most 3{k — 1)(30A: — 2) non special vertices 
in D. □ 

To sum up, the kernelization algorithm is as follows: starting from a rooted 
digraph D, apply the reduction rules. Let D' be the obtained reduced rooted 
digraph. If D has size more than (3k — 2)(30A: — 2), then reduce to a trivially 
TRUE instance. Otherwise, D' is an instance equivalent to D of size quadratic 
in k. 

Our analysis for this quadratic kernel for ROOTED MAXIMUM LEAF OUTBRANCHiNG 
is actually tight up to a constant factor. Indeed, the following graph T; is re- 
duced under the reduction rules stated on Section [3] and has a number of ver- 
tices quadratic in its maximal number of leaves. Let V = {vij\i = 
j = 1, . . . , 3(Z — 1)}. For every i = 1, . . . ,1, (r, Vi^i) is an arc of T. For ev- 
ery j = 1, . . . , 3Z — 2, i = {vij,Vij+i) is a 2-circuit of T/. For every 
i = 1, . . . (wi,3i-i,i'j+i[i],3Z-i) is an arc of T/. For every t = - 1, 
i = 1, . . . (wi,3t, fi+tii],!) is an arc of Ti. This digraph Ti is reduced under 
the reduction rules of Section [H and maxleaf{Ti) = 2{l — 1). Finally, Ti has 
3l{l — I) + 1 vertices. 

Note that this graph has many 2-circuits. We are not able to deal with 
them with respect to kernelization. For the approximation on the contrary, we 
are able to deal with the 2-circuits to produce a constant factor approximation 
algorithm. 

5 Approximation 

Let us first point out that the reduction rules described in Section[4] directly give 
an approximation algorithm asymptotically as good as the best known approxi- 
mation algorithm [lO]. Indeed, as these rules are independant of the parameter, 
and as our proof of the existence of a solution of size k when the reduced graph 
has size more than 3{k — l)(30fc — 2) is contractive, this yields a 0{vOPT) 
approximation algorithm. Let us sketch this approximation algorithm. Start by 
applying the reduction rules described in Section |4] to the input rooted digraph. 
This does not change the value of the problem. Let m be the size of the reduced 
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graph. Exhibit an outbranching with at least leaves as in the proof of 

Theorem [51 Finally, undo the sequence of contractions yield by the application 
of reduction rules at the start of the algorithm, repairing the tree as in the proof 
of Lemma [H The tree thus obtained has at least leaves, while the tree 

with maximum number of leaves in the input graph has at most to — 1 leaves. 
Thus this algorithm is an 0(V OPT) approximation algorithm. 

Let us describe now our constant factor approximation algorithm for ROOTED 
Maximum Leaf Outbranching, being understood that this also gives an ap- 
proximation algorithm of the same factor for MAXIMUM LEAF OUTBRANCHING as 
well as for finding an out-tree (not necessarily spanning) with many leaves in a 
digraph. 

Given a rooted digraph D" , apply exhaustively Rule (1) of Section [3l The 
resulting rooted digraph D is 2-connected. By Lemma IH maxleaf{D") = 
maxleaf{D). 

Let us denote by Dns the digraph D restricted to non special vertices. Recall 
that Dns is a dijoint union of bipaths, which we call non special components. A 
vertex of outdegree 1 in Dns is called an end. Each end has exactly one special 
vertex as an in-neighbour in D. 

Theorem 4 Let D be a 2-connected rooted digraph with I special vertices and 
h non special components. Then max{^, h — I) < maxleaf{D) < I + 2h. 

Proof: The upper bound is clear, as at most two vertices in a given non special 
component can be leaves of a given outbranching. The first term of the lower 
bound comes from Theorem [T] and Theorem [2l To establish the second term, 
consider the digraph D' which vertices are the special vertices of D and r. For 
every non special component of D, add an edge in D' between the special in- 
neighbours of its two ends. Consider an outbranching of D' rooted at r. This 
outbranching uses I — 1 edges in D' , and directly corresponds to an out-tree T 
in D. Extend T into an outbranching T o{ D. Every non special component 
which is not used in T contributes to at least a leaf in T, which concludes the 
proof. □ 

Consider the best of the three outbranchings of D obtained in polynomial 
time by Theorem [1] Theorem [2] and Theorem [H This outbranching has at least 
max (j^, ft. — I) leaves. The worst case is when ~ h — I. In this case, the 
upper bound becomes: hence we have a factor 92 approximation algorithm 
for Rooted Maximum Leaf Outbranching. 

6 Conclusion 

We have given a quadratic kernel and a constant factor approximation algorithm 
for Rooted Maximum Leaf Outbranching: reducing the gap between the 
problem of finding trees with many leaves in undirected and directed graphs. 
Maximum Leaf Spanning Tree has a factor 2 approximation algorithm, and 
Rooted Maximum Leaf Outbranching now has a factor 92 approximation 



11 



algorithm. Reducing this 92 factor into a small constant is one challenge. The 
gap now essentially lies in the fact that MAXIMUM LEAF SPANNING TREE has a 
Hnear kernel while ROOTED MAXIMUM LEAF OUTBRANCHING has a quadratic 
kernel. Deciding whether ROOTED MAXIMUM LEAF OUTBRANCHiNG has a 
Hnear kernel is a challenging question. Whether long paths made of 2-circuits 
can be dealt with or not might be key to this respect. 
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